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SUMMARY 


The  Cambridge  Project  Is  a  cooperative  effort  by  a 
variety  of  scientists,  both  at  M.l.T.  and  at  Harvard, 
to  make  the  digital  computer  more  useful  In  the  behavioral 
sciences,  basic  and  applied.  This  Technical  Report  su^na- 
rlzes  progress  during  the  half  year  beginning  In  July  of 
1971  --  l.e.,  the  half  year  Immediately  following  the 
period  covered  by  the  Project's  second  Annual  Report. 

The  Project  has  two  main  goals:  first,  developing 
programs  and  other  computing  tools  that  are  needed  In 
behavioral  science  and  Its  applications;  second,  combining 
these  tools,  and  others  borrowed  from  other  sources,  Into 
a  Consistent  System  of  programs,  models,  and  data  that 
behavioral  scientists  can  use  on-line.  In  both  of  those 
areas,  the  primary  effort  Is  devoted  to  writing  and  docu¬ 
menting  programs;  however,  lesser  amounts  of  effort  have 
been  devoted  to  theoretical  studies  of  statistical  tech¬ 
niques,  equipping  a  computer-based  laboratory  for  the 
study  of  autonomic  conditioning,  and  so  on. 

The  most  conspicuous  progress  during  the  time  covered 
by  this  report  was  In  the  development  of  the  Consistent 
System.  During  this  period  the  foundation  for  the  system  v 

went  Into  service  within  the  Multlcs  time-sharing  system 
at  M.l.T,,  and  a  library  of  programs  began  to  collect  upon 
that  foundation.  These  developments  are  described  In 
Section  6  under  five  headings:  the  Substrate,  conventions, 
utility  subroutines  and  basic  programs,  documentation,  and 
growth  of  the  library. 

Sections  2,  3,  4,  and  5  are  concerned  with  the  first 
of  the  two  main  goals,  developing  programs  and  other  com¬ 
puting  tools.  They  are  divided  as  follows: 

Section  2  Is  on  data-hand 1 1 ng,and  summarizes  progress 
In:  the  development  of  Janus,  which  Is  an  advanced  system  for  hand¬ 
ling  data  of  a  type  common  in  the  behavioral  sciences;  a 
facility  that  employs  a  set-theoretic  approach  to  handling 
behavioral  science  data;  programs  that  assist  an  Investi¬ 
gator  to  locate  within  a  collection  of  surveys  questions 
that  are  relevant  to  his  problem;  end  subsystems  for  hand¬ 
ling  nature  I - 1 enguage  text. 

Section  3  Is  on  dota-analysls,  end  summarizes  progress 
In:  basic  statistical  methods  end  algorithms;  Incorporating 
Into  the  Consistent  System  the  statistical  programs  from 
Toss,  which  was  an  experlmentel  version  of  a  consistent 
system;  Incorporating  a  group  of  Interactive  statistical 
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programs  developed  for  a  PDP-10;  Incorporating  the  Time- 
Series  Processor  system;  application  of  spectral  analysis 
to  time-series;  fitting  non-llnear  growth  models  to  longi¬ 
tudinal  data;  programs  for  analyzing  networks  of  relations 
or  transactions;  testing  the  General  Inquirer's  facilities 
for  distinguishing  between  homographs  In  analyzing  English 
text;  and  construction  of  two  thesauruses  to  be  used  In 
analysis  of  policy  documents. 

Section  4  Is  on  modeling,  and  summarizes  progress  In: 
DISCOURSE,  an  Interactive  system  for  constructing  models 
In  which  geographic  locations  are  Important;  the  Impllcator, 
which  helps  Investigators  to  transform  social  theories  from 
words  Into  computer  models;  Interactive  facilities  for 
Bayesian  logistic  regression  models;  evaluation  of  a  set 
of  algorithms  fc  modeling  social  networks;  modules  for 
probabilistic  models,  especially  decision  models;  3nalysls 
of  project-planning  networks;  validating  against  data 
from  many  geographic  sub-areas  the  simulation  of  processes 
that  extend  over  a  wider  area;  and  adapting  to  Multlcs  the 
CASCON  system  for  modeling  Information  on  local  conflicts. 

Section  5  Is  on  human  factors  In  Interactive  computing 
and  on  computer  control  of  experiments.  It  summarizes 
progress  In:  a  computer-based  laboratory  for  the  study  of 
autonomic  conditioning;  a  group  of  programs  for  manipulating 
digitized  sound  waves;  the  Interactive  Graphic  Decision 
Aid;  the  On-line  Decision-Making  Laboratory;  and  a  CRT- 
based  program  for  studying  Interaction  between  a  computer 
and  users  with  "Intuitive"  thinking  styles. 

Work  In  both  areas  --  the  development  of  a  Consistent 
System,  and  the  development  of  particular  tools  for  behavioral 
science  and  Its  applications  --  will  continue  during  the 
coming  half  year  under  the  same  contract. 
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I.  INTRODUCTION 


The  Cambridge  Project  Is  a  cooperative  effort  to 
make  digital  computers  more  useful  In  the  behavioral 
sciences,  both  basic  and  applied.  This  semiannual 
technical  report  covers  the  half  year  that  began  In 
July  of  1971,  where  the  second  Annual  Report  left  off. 

The  purposes  of  the  Project  were  set  forth  In  some 
detail  In  Part  I  of  the  first  Annual  Report,  and  were 
reviewed  briefly  in  the  second.  As  was  explained 
there,  the  Project  has  two  main  purposes:  to  develop 
computing  tools  needed  In  the  behavioral  sciences,  and 
to  assemble  such  tools  Into  a  Consistent  System  that 
the  scientist  can  use  Interactively.  The  tools  will, 
for  the  most  part,  be  programs,  but  may  also  Include 
models,  or  sets  of  generally  useful  data.  The 
Consistent  System  will  heln  ensure  that  they  can  be 
used  In  unexpected  combination  by  the  researcher  who 
may  not  be  a  programmer  In  the  traditional  sense.  We 
expect  that  In  the  Consistent  System  these  tools  will 
therefore  be  much  more  useful  than  they  would  In 
Isolation. 

Computer  scientists,  statisticians,  and  a  variety 
of  behavioral  scientists  from  both  M. I.T.  and  Harvard 
are  cooperatlnq  In  the  Project.  As  In  previous  years, 
some  60  membors  of  the  two  faculties  participated. 
They  are  complemented  bv  numerous  students  and  research 
personnel,  and  there  Is  also  a  staff  of  full-time 
enployoos  responsible  for  designing  and  constructing 
the  framework  of  the  Consistent  System,  for  orovldlnq 
somo  of  the  basic  tools  that  form  the  collection,  and 
for  coordinating  tho  documentation  of  the  rest. 

During  the  first  two  years  the  Project  was 
supported  by  the  Advanced  Research  Projects  Agency 
under  Contract  DAHCI5  69  C  0347. 
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2.  DATA  HANDLING 


The  Project’s  general  plans  In  the  area  of  data 
handling  were  detailed  In  Section  D. I  of  Part  I  of  the 
first  Annual  Report,  They  range  from  specific  programs 
for  which  there  Is  an  Immediate  need  to  general 
research  on  managing  data  of  the  kinds  that  behavioral 
scientists  use. 

Principal  among  the  specific  programs  Is  the  Janus 
system  for  Interactive  data  handling  and  analysis.  The 
overall  objectives  are  to  provide  a  language  Interface 
between  users  and  their  data,  to  perform  common 
data-preparat I  on  activities,  to  provide  formatted 
output,  to  Interface  with  other  analysis  procedures  In 
the  Consistent  System,  and  to  permit  Inspection, 
manipulation,  creation,  and  cross-ref erenc I ng  of 
datasets.  More  details  are  given  In  Section  2.1  of  the 
second  Annual  Report. 

A  prototvoo  of  Janus  has  been  working  In  Toss  (an 
exploratory  version  of  the  Consistent  System)  since 
November.  Recently,  It  has  been  modified  so  as  to  work 
within  the  Consistent  System.  Extensive  timings  and 
testing  of  the  nrototyoe  are  near  completion  and  a 
small  groun  of  experimental  users  Is  being  Introduced 
to  It. 


Feedback  from  these  activities  Is  being  used  In 
designing  the  final  version  of  Janus.  Judging  from 
results  from  the  prototvoo  version,  the  basic  design 
appears  we  I  I  suited  to  Multics  and  to  the  extended 
capabilities  planned  for  the  final  version.  Design  of 
tho  final  version  Is  well  under  way,  and  actlvo 
Implementation  will  probably  begin  In  late  soring. 

A  set- thooreti c  approach  to  data  management  Is 
also  being  explored,  A  prototype  Implementation  Is 
complete  and  Is  undergoing  performance  tests  with 
various  types  of  databases  and  queries.  A  direct 
comparison  of  this  approach  with  more  traditional 
approaches  to  data-managomont  is  being  undertaken  with 
a  vlow  to  usors'  anticipated  regu I rements . 

Becauso  large-sample  surveys  can  be  expensive, 
multiple  use  of  them  Is  desirable,  and  collections  of 
surveys  have  thoreforo  beon  established  (e.q.,  The 
Roner  Centor).  However,  Information  pertinent  to  the 
investigator’s  interest  mav  be  distributed  throuqh  many 
surveys,  and  he  may  have  difficulty  finding  It.  Some 
basic  softwaro  components  (the  data  base  generation  and 
rotr leva  I  command  language)  that  assist  In  finding 
identical  or  similar  questions  in  different  surveys 
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have  been  made  operational  with  a  subset  of  the  Gallup 
Survey  files.  The  language  allows  a  user  to  make 
queries  about  the  number  of  questions  satisfying 
certain  logical  conditions  and  then  probe  the  text  of 
the  questions  and  other  data-descrl ptor  Information. 
The  system  allows  a  flexible  definition  of  user- 
environments  that  accomodate  different  research  styles 
and  Interests.  The  present  software  Is  based  upon 
CP/CMS.  Current  work  Is  on  documentation  and  problems 
associated  with  storage  of  many  large  flies  within  the 
Consistent  System. 

In  the  area  of  text-handling,  a  number  of 
subsystems  have  been  Implemented  from  the  components 
discussed  In  Section  2.2  of  the  second  Annual  Report. 
Two  serious  difficulties  have  been  encountered.  First, 
sufficiently  accurate  tools  to  monitor  CPU  usage  were 
not  available  In  Multlcs  and  subsystem  tuning  suffered 
as  a  result.  Second,  attempts  to  provide  subsystems 
with  a  full  range  of  powers  available  In  Multlcs  led  to 
unacceptably  high  overhead.  It  appears  that  many  of 
the  features  that  distinguish  Multlcs  from  other 
operating  systems  cannot  be  used  effectively  at  present 
by  subsystems  of  this  type.  However,  valuable 
experience  In  problem  conceptualization  was  gained,  and 
during  the  next  six  months  some  programs  will  be 
transferred  to  the  IBM  370/155  at  MIT. 
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3.  DATA  ANALYSIS 


Section  D.2  In  Part  I  of  the  first  Annual  Report 
explained  that  the  Project's  general  plans  about  the 
analysis  of  data  Include  the  development  of  tools  for 
four  kinds  of  analysis:  multivariate  techniques, 
primarily  multidimensional  scaling;  analysis  of 
networks;  analysis  of  longitudinal  data,  like 
time-series;  and  analysis  of  natural  text.  In  all  four 
cases  we  are  Interested  not  only  in  the  develoDment  of 
new  methods,  but  In  the  evaluation  of  those  that 
a  I  ready  exist. 

During  the  period  covered  here,  basic  work  In 
regression  analysis  has  centered  on  an  empirical  study 
of  alternatives  to  the  ordinary  method  of  least 
squares.  Several  have  been  found  to  be  superior,  and  a 
report  of  the  findings  will  be  prepared  separately.  An 
attempt  to  fit  logistic  models  to  counted  data  with 
random  effects  Is  In  the  preliminary  stages,  and  an 
interactive  system  for  fitting  log-linear  models  to 
multidimensional  contingency  tables  has  been  specified. 
Also,  Portnoy  has  refined  his  nrocedure  for  determining 
the  dimensionality  of  non- l Inear  data  structures  by  a 
method  of  local  orlncioal  components  and  two  reports  on 
It  are  available:  "Local  Principal  Components;  A 
Method  for  Determining  the  Dimensionality  of  Non-Linear 
Data  Structures"  and  "A  Computer  Proqram  for  Testinq 
and  Apnlylnq  the  Method  of  Local  Principal  Components". 
Previous  work  in  all  these  areas  was  described  in 
Section  3.1  of  the  second  Annual  Report. 

Two  doctoral  dissertations  on  statistical  methods 
were  completed  with  contract  support.  One,  by  R. 
Heiberger,  is  a  three  part  thesis  on  statistical 
computation.  The  other,  by  B.  Rosner,  Is  on  a 
sem I -Bayes  I  an  apnroach  to  outlier  detection.  Copies  of 
each  are  ava liable. 

A  number  of  common  statistical  procedures,  about 
fifty  In  all,  are  being  Introduced  Into  the  library  of 
the  Consistent  System.  Most  of  them  are  from  the 
earlier  Toss  system  and  are  listed  in  Section  3.6  of 
the  second  Annual  Report.  They  include  frequency 
distributions;  measures  of  central  tendency, 
variability,  and  higher  moments;  parametric  and 
non-parametri c  tests;  product  moment  and  rank  order 
correlations;  and  simple  regression.  Moving  these 
procedures  from  Toss  to  the  Consistent  System  was  made 
easier  by  their  design:  the  computational  "kernel"  of 
each  of  them  was  written  as  a  closed  subroutine,  and  an 
interface  can  be  easllv  supplied  to  honor  the 
requirements  of  the  Consistent  System.  Furthermore, 
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the  kernels  can  be  used  as  components  In  one  or  another 
of  the  statistical  "packages"  proposed  for 
construction,  or  could  be  called  directly  by  suitable 
data  management  Interfaces. 

Special  efforts  are  being  directed  toward  analysis 
techniques  that  take  advantage  of  the  Interactive 
environment  of  Multlcs.  A  collection  of  such  programs. 
Including  many  basic  statistical  programs  as  well  as  a 
newly-designed  program  for  Interactive  factor  analysis, 
is  being  readied  for  the  Consistent  System.  The 
collection  was  originally  written  In  FORTRAN  on  the 
PDP-IO.  It  Is  now  almost  completely  debugged  and,  so 
far  as  possible,  all  system-deDe ndent  features  are 
being  removed  In  {preparation  for  adapting  It  to 
Multlcs. 

In  the  area  of  longitudinal  methods,  TSP  (the 
Time-Series  Processor)  has  been  adapted  to  Multlcs  and 
Is  being  Introduced  Into  the  Consistent  System.  TSP  Is 
a  sizeable  system  of  FORTRAN  programs  aimed  primarily 
at  the  analysis  of  economic  time-series.  It  was 
designed  for  batch  processing,  and  modifications  that 
will  exploit  the  Interactive  possibilities  of  Multlcs 
are  being  studied. 

Several  applications  of  spectral  analysis  to 
economics  havo  been  explored.  (As  an  example,  a 
significant  effect  of  the  price  of  capital  on  national 
Investment  has  been  found  for  components  around  the 
business  cycle  frequencies.)  A  set  of  spectral 
programs,  based  on  estimation  using  the  fast  Fourier 
transform  algorithm,  are  being  tested  nrlor  to 
Implementation  In  TSP;  these  will  compute  (cross-) 
perlodograms  and  smoothed  spectral  estimators  using 
"prewhitening"  and  alignment.  A  report  Is  In 
preparation  detailing  computation  of  regression 
estimators  using  only  part  of  the  spectrum  --  Important 
In  economics,  wh«»  •»  It  Is  desirable  to  purge  a 
regression  of  Its  seasonal  or  other  frequency 
components. 

Several  programs  for  longitudinal  analysis  aimed- 
at  the  validation  of  non-l Inear  growth  models  have  also 
been  developed  and  documented.  The  programs  and  their 
descriptions  are  available. 


The  program  SOCPAC,  which  analyzes  the  triad 
structure  of  sociometric  networks,  has  been  modularized 
with  a  view  toward  bringing  the  modules  onto  Multlcs 
within  the  Consistent  System,  and  a  set  of  notes  Is 
being  prepared  on  Statistical  Modeling  of  Network  Data. 
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RANULL,  a  program  for  analysis  of  transaction 
flows,  and  DICHOT,  a  program  for  3  hierarchical 
di chotoml zatl on  of  transaction  matrices  by  the 
McQu I tty-CI ark  method  of  Iterative  l ntorco I umnar 
corre I  at  Ions ,  are  being  transferred  to  Multlcs  for 
Inclusion  In  the  Consistent  System.  It  Is  expected 
that  RANULL,  which  uses  the  "null  model"  cf  Savage  and 
Deutsch,  will  be  Integrated  with  two  newly-written 
programs,  DECISION  and  RWC_POWER.  The  first  of  these 
Is  a  simple  programming  of  the  Day-T I nney* mode  I ;  the 
second  ci  leu  I  ales  Dahl’s  "power"  Index  and  Chadwick’s 
"control"  Index.  A  fuller  discussion  Is  given  In  a 
report  by  Richard  W.  Chadwick:  "Power,  Control,  Social 
Entropy,  and  the  Concept  of  Causation  In  Social 
Science".  Previous  reports  on  related  work  w^re  listed 
in  Section  3.10  of  the  second  Annual  Report. 

As  part  of  the  work  on  the  General  inquirer,  a 
large  effort  to  provide  subroutines  that  disambiguate 
English  homographs  Is  nearing  completion.  (See  Section 
3.14  of  the  second  Annual  Report.)  Test  runs  indicate 
93 %  success  In  Identifying  the  correct  sense  in  a 
sample  of  *70  difficult  English  words.  Supporting 
programs.  Including  a  retrieval  specification  format  In 
pseudo-PL/i  that  is  converted  into  real  PL/ I  via  the 
pre-compiler,  are  now  In  final  stages. 

in  a  parallel  effort  In  the  analysis  of  text,  two 
thesauruses  are  being  constructed.  An  expert  on  the 
domestic  and  forelqn  oolltlcs  of  the  USSR  has  developed 
concent  hierarchies  and  the  beginnings  of  a  thesaurus 
for  Sovlot  documents.  A  thesaurus  for  American 
documents  Is  being  constructed  automatically  from  text 
taken  from  "The  New  York  Times".  A  report  entitled 
"Data  Sets  for  the  Development  of  a  Document  Encoding 
and  Retrieval  System:  A  Description  of  Computer 
Readable  Texts  and  Queries"  by  Rosemarie  Rogers  Is 
available.  For  earlier  work,  see  Section  3.13  of  the 
second  Annual  Report. 


4.  MODELING 


Th©  general  plans  for  tools  tor  building  and 
evaluating  models  were  described  In  Section  D.3  of  Part 
I  of  the  first  Annual  Report. 

Several  new  features  have  been  added  to  DISCOURSE, 
which  was  discussed  In  Section  4.1  of  the  second  Annual 
Report.  it  Is  an  Interactive  computer  lanquaqe  for 
addressing  problems  In  which  geographic  parameters  are 
Important.  'The  lanquaqe  has  been  modified  so  the 
syntax  of  a  DISCOURSE  command  now  accomodates  use  of 
non-Dl SCOURSE  functions,  general  expression  evaluation, 
and  recursively  Imbedded  strings  of  characters. 
Efficiency  of  operation  has  been  Improved  by  hashing 
all  names  through  a  single  table  that  Includes  synonyms 
end  by  providing  a  preprocessed  modo  that  allows 
preinterpretation  of  commands. 

The  basic  Input  and  retrieval  programs  for  the 
General  Impllcator,  which  was  discussed  in  Section  4.2 
of  the  second  Annual  Report,  are  running  reliably. 
Current  work  Is  focused  on  building  out  of  those  basic 
programs  macro-programs  for  special  applications  —  for 
example,  a  program  that  will  Identify  causal  or 
Inferential  chains.  Also,  the  present  Implementation 
In  PL/I  Is  being  compared  with  poss I bl 1 1 1 los  In  list 
processing  and  other  spocial  lanquages. 

Tite  Bayesian  logistic  regression  model  described 
In  Section  II.D.7  of  the  first  Annual  Report  Is  oelnq 
fitted  Into  an  on-line  data  analysis  system  that 
emphasizes  Bayesian  methodology.  This  Is  a  step  toward 
Introducing  It  Into  the  Consistent  System. 

Some  evaluation  of  the  social  network  models 
described  In  Section  4.6  of  the  second  Annuel  Report  have 

been  made  In  an  attempt  to  Identify  those 
cha  acterlst I cs  of  the  networks  which  have  predictive 
and  descriptive  value.  A  number  of  routines  have  been 
developed  In  the  APL  language  to  search  for  such 
character  I st I cs  as  cllnues,  clusters,  cycles,  semigroup 
homomorph I  sms ,  subgraph  isomorphisms,  and  graph 
automorphisms.  The  results  have  been  disquieting,  and 
there  Is  some  question  about  which  of  the  routines 
should  be  Included  In  the  Consistent  System. 

A  set  of  tools  for  probabilistic  models,  decision 
models  In  particular,  which  were  described  In  Section 
4.7  of  the  second  Annual  Report  are  a  step  nearer 
completion.  The  primary  effort  over  the  past  six 
months  has  been  on  debugging.  During  this  period, 
however,  the  system  has  been  used  for  graduate  and 
undergraduate  research  projects.  Considerable 
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roprogramml nq  and  documentation  efforts  are  bolng  done 
In  preparation  for  Introducing  these  programs  Into  the 
Consistent  System. 

In  tho  area  of  Project  Network  Analysts*  described 
In  Section  4.g  of  the  second  Annual  Report,  four 
Interactive  comnuter  nrograms  have  now  been  devolopod: 
a  proqram  for  creating  and  editing  a  network;  a  program 
for  analysis  of  a  network  In  terms  of  physical 
structure  and  other  oarametrrs;  a  nrooram  which  uses 
some  of  the  outnuts  of  the  previous  program  to  predict 
the  result  of  n  particular  scheduling  algorithm;  and  a 
program  that  uses  one  of  five  alnorlthms  to  produce  a 
schedule  for  the  network.  In  addition*  hatch  versions 
of  two  of  tho  programs  have  been  deve'onod.  A  manual 
describing  the  oroqrans  Is  In  preparation  by  E.  W. 
Davis. 


Simulation  tochnlauos  for  small  geographic  areas* 
described  In  Section  4.||  of  the  second  Annual  Report* 
have  made  progress  on  three  fronts.  A  streamiing  of 
data  reduction  programs  Into  a  sequence  of  well- 
defined  steps  has  been  accomplished.  A  "room" 
capability  has  been  added  In  the  mapping  routine  to 
permit  examination  of  a  single  area.  Finally,  a 
"tuner"  Is  nearing  completion.  It  will  take  a 
theoretically  sound  model  that  approximates  the  dcta 
reasonably  and  will  adjust  the  model  to  Its  ultimate 
accuracy. 

Finally*  support  has  been  given  to  an  effort  to 
adapt  to  Multlcs  the  CASCfWI  svstem,  the  Computer-Aided 
System  for  Information  on  local  Conflicts. 
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5.  HUMAN  FACTORS  AND  CONTROL  OF  LXPERI MEMTS 

Gennrn I  olens  for  the  davaloomant  of  mtthods  fnr 
computer  control  of  txpprlmants  were  detailed  In 
Section  0.4  of  Pert  I  of  the  first  Annuel  Report,  while 
reseerch  on  human  factors  In  Interactive  computer  usage 
was  discussed  In  Section  E.3. 

The  computer-based  fecIMty  for  study  of  autonomic 
behavior  is  now  In  use  and  was  described  In  Section  5. 1 
of  the  second  Annual  Report.  Experiments  are  being 
done  on  asslstlr.q  people  In  gaining  volurtarv  control 
over  their  autonomic  activity  --  e.g.,  heart  rate. 

The  presumption  Is  that  If  people  can  control  their 
autonomic  erousal  under  stress,  their  ability  *o 
perform  perceptual  and  motor  fasts  under  stress  will 
Improve.  Specialised  hardware  and  software  have  been 
developed  tot  (I)  train  people  to  onln  control  of  heart 
rate  for  a  short  time,  <2)  train  thee  to  become 
Increasingly  accurate  In  assessing  their  autonomic 
arousai,  and  (3)  test  perceptual  performance  under 
stress.  The  Immediate  conclusion  Is  fh#t  some  people 
can  gain  control  of  heart  rate,  and  that  thev  can 
exerclst  the  control  to  Improvm  their  performance. 
Future  work  will  bo  concerned  with  Individual  I  tine  the 
training  process  so  that  most  people  can  learn  to 
control  heart  rate  *or  lone  periods  of  time. 

A  Conversational  Audio  Signal  Manipulator  (CAS*<| 
Is  being  developed  on  ths  RDF-3T  In  the  Computer  Rated 
Laboratory  o>  the  Harvard  Psychology  Department,  C*SM 
has  three  nartsi  an  Interpretive  component,  which 
decodes  the  user’s  teletype  requests  to  transform  audio 
signals  and  move  them  between  storage  devices;  a 
special  dish  directory  scheme  for  referencing  stored 
signals;  and  a  battery  of  frenstormet Ion  routines. 

Development  wo rt  continued  on  the  Interactive 
Graphic  Decision  Aid  which  Is  helno  programmed  on  a 
POP- 10  computer  with  an  LOS-I  display,  and  was 
described  In  Section  5,3  o*  the  second  Annuel 

Report,  The  system  exploits  computer  oraphlcs 

technology  In  order  to  apply  statistical  decision 
theory  as  a  practical  tool  to r  daclslon  mate's. 
Preeeretlons  ere  underway  tor  observlno  the  effect  of 
such  an  aid  on  a  real  decision  maVer, 

The  On-Line  Dec  Is  Ion-Met  I ng  laboratory,  discussed 
briefly  In  Section  3.4  of  the  second  Annuel  Report,  Is 
no*  operating.  The  new  simulation  model  with  which  the 
flayers  Interact  Is  In  the  final  stages  of  debugging 
end  the  Information  recording  programs  are  operating  on 
a  POP- 10.  in  March  tha  model  will  be  fun  with 

over  TOO  graduate  students  participating,  and  two  trial 
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tests  of  the  Information  gathering  system,  with  150 
students  participating  In  each,  will  be  conducted 
during  this  period.  At  present  the  Information 
gathering  system  Is  passive!  It  records  the  time  and 
nature  of  decisions  »nd  the  patterns  of  use  of  on-line 
models.  The  ne*t  stese  Is  to  develop  active  cues  to 
allow  the  decision  meters  to  he  euestloned  about  causes 
of  unusual  activity, 

A  version  of  a  COT-basmd  program  to  Study  the 
software  elements  that  facilitate  “Intuitive" 
nan—  arh '  re  Interaction  has  been  implemented  and 
revised,  it  was  described  In  Section  5.6  of  the  second 
Annual  Oanort,  Oaf  a  on  subject  performance  If  In  the 
process  o*  collection  and  evaluation. 
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6.  THC  COH5I5TCMT  SYSTCM 


Cue  of  the  mejor  «i©els  of  the  Cambridge  Project  t3 
to  fora©  Info  e  "Consistent  System"  tools  such  »s  those 
described  In  the  precedin')  sections.  The  System  »e S 
discussed  In  detell  In  Chapter  f,  of  the  second  Annual 
Piport,  especially  In  Sections  (,l  «n<  6.}.  It  It  to 
be  *  co1  lection  of  n-odeltj  daft,  end  program*  for  use 
by  the  no.tprooramnl n©  fetMt)*rtl  scientist.  If  the 
Innovative  lnv#stlcefon  wants  *o  confine  the  fools  In 
son#  novel  wav  to  meet  Ms  speelel  needs,  he  should 
not  need  e  programmer  t©  help  hie.  »*#  should  be  able 
tc  introduce  his  o*n  defe  Into  the  system  end.  It  he  Is 
#n  enefeur  prosr  eetwer ,  he  should  he  able  to  Introduce 
his  0*n  pr  ©nwyny  f©©j  n  mover ,  his  nrefrsms  end  data 
should  wots  ©racetullv  with  those  eireedy  In  the 
col  lection 

*n«  framework  •*»©*  holds  the  collection  *©oe*ner 
consists  off  e  Sut'S  f  r  »fe«  ehlf.h  Is  a  MCtatt  o* 

routines,  spmewhpt  ll»«  an  onerefJnf  system*  on  ehlch 
the  pr^r/w m  l«  the  collection  depend)  •  set  ©t 
convention*  She?  problems  end  d*fe  ere  ••nyeted  »e 
Obey  |  end  sMnd#rd*  *®r  Oocueentef  Ion.  I>»a|as  the 
pes*  nelf  yeer  unit  o*  the  efforts  of  the  Project  ***** 
he*e  been  dev©f«d  to  prowldlnf  thet  *r#n*w©rs  #*d  i^e 
of  the  more  fenden-er f *1  pf©>sr*ey  eny  subroutines  fhet 
«l f  ef thin  If, 

the  projections  *b«»f  profress  In  building  e 

freneeprs  thet  were  n,*fe  si*  months  *0©  *n  the  second 

Annuel  *e«'Or»  proved  rem*rs*Mp  etturete.  A  ^vtiitHi 

Itffl  on  the  ttwlllcs  *lme«fher|nf  s*|t*n  ef  *,  1,1,  wfl 
(etflttff  essentially  #n  schedule.  In  spit#  of  •ft  fee* 
thef  Vhbfe  »*ne.  Ineel  table,  a  taw  patio*  chenfes  elo«* 
the  way  a  small*  se*  O*  utility  su*r©yf|nwt  they  * 
prosreni  •  *  *  *0  h#nfle  files.  fdiiMh* eiltet#  elfh  the 

Untra,i,  e‘0.,  U  reefy,  end  so  1%  *  smell  5#ttf  ©# 
propmni  form  f  vy  nucleus  ©•  the  collection  •  the 

presteft  neaped  ?#  test  other  tr«ff#«|.  the 
conventions  fhes  esnecy  ere  rp#s«rfM»  eel  I 

established,  fh#**P#  they  need  *0  I*  ClflMfl  ♦»* 
fe»%tb*e#  «•  m**e  then  eester  t# r  pr#©r  areoiar*  f# 
un#ers»*nf,  Gocomentef  le*>,  *l  predicted,  Ms  !•**#* 

behind, 

the  rts»  of  this  settlor  el  1 1  »Mse  Mbits 

In  mpre  fvlall. 

the  •»  the  Ctmslstenf  System  %l«*  upc#  « 

sotfeere  »«yndi»ien  v«ew*.  #i  the  "Jsiiffif#*,  the 
Substrate  series  #s  en  Infer  *ece  M*ee#»  program*  In 
the  Consistent  Iff  too  **f  IN  fff'flcs  pferetihb  system 
••  MblNN  In  the  Consistent  System  ##  n§t  yell  *#«* 

»l 


Mgltlca  directly.  The  Substrate  It  the  oneretlno 
syaten  fron  ,1  pronrnn'a  point  of  vleej  of  course,  the 
InpienontAtlon  of  the  Subttrnfe  Cilia  ht*vl ly  upon 
Mg  I tic*. 

Th'»  SubafrAte  hey  three  ®i»Jor  pgrpoaest  It  buffer^ 
profjrpr.a  AOAlnat  Inst Abl 1 1 ? to*  or  chenoea  la  the 

Mgitlcr.  operntlno  ayafenj  It  la  4eslyp#d  to  be 
Imp  #4  on  hoata  other  th#p  Mgitlesi  entf  It 

provldea  e  ngmber  o*  nervtcea  that  ere  peeked  by  a 
collertlon  of  thla  Mnd,  but  ere  no*  AvAllAbl#  In  ell 
th«*  opAratlno  ayatema  that  nlpht  eventual  fy  aerve  ea 
hosts. 


the  lira*  lob,  InaulAtlno  nroo^Ans  egalna*  fhenjjes 
In  the  npe*»ti«o  system,  la  streleht *ore#r<,  The 
SwSsatrete  ees  an  realynesl  thy*  a®»||  e*>«*1*l  In  Baltic* 
can  be  c©®ren« *te4  *or  Internally,  preaerelny  the 
erternei  a*eei  *  lc«t Iona  t®  pr©yr#ns  in  the 

collection,  A  pr®®r#e  cnntecta  the  operetln®  avaten 
only  lr<f Irec'lv,  throe##  the  end  thr®y®h  the 

•rwn«!lne  Sehport*  of  the  pr©3r*e«ln*| 

In  ehlc#  If  e*s  erltfen,  At  the  moment,  e*pilclf 
aepfor*  la  rr®*l®e«  *er  the  ANSI  l#«tv*9«l 

**l/l  t*t  Contact  elf#  the  t»itlet  operetlno 

ays*#©,  then,  la  iln.tef  to  their  r***tlee  p#ct#®#%  t*i| 
to  the  5«*tfr*!e,  ehlc#  ytteenfa  to  *%e  only  those 
oerta  o*  *fy  |  tic*  t  he  t  ere  eo%*  lllely  to  re«*#ln  rtejle, 
pr l*clp*l ly  the  *her Poore  *ef *r*l apr *  ft*  /0 

%ra*eo. 

To  eec«on|iah  It*  t*tfht  *ren?#er  l« 

Other  oe&blnea,  *►-#  4eO|frefe  does  *ot  eehe  *se  ©» 
ih®te  ie*f«r*a  ©t  h«lfUa  thy*  ere  lllely  if  |e  mltflnn 
»r#e  other  oferellno  irtV«n,  Infeef  #  deal**  ||*l  eta 
imp leechttyl  1 1  ty  ©n  ays*eea  #a  tl«*Mt  ea  t*eiflf%  enf 

0*%/K4,  A»  e  conaeoyeec#,  ao**  fe*f%*«r*  of  ffyi  f  I  Cl  nr# 
t®  proiree-s  In  t*e  coMertlpn  cpoplefely) 
Others  ere  f  r  ®« I  Oe-p  hy  f®##  el  thin  the  £efafr#te 
liltlt,  It  the  iy%atr#te  la  Implement#?  ®o  e«#tter 
*#!*,  fh**  the  n.ejerity  •*  pr#*reea  In  thy  Ct*tl|l«*t 
iylfen  tfh  r*n  on  thef  nee  e.ethlne,  *f*«*f#Ify  they 
eft  <f  he*e  to  h*  #efe»*|  Ulj  i«  the  m«  ne.hSIne  epyl? 
in*  to  atPh*r»  the  l»*tm«  In  #hi€*  t*e 

prooreea  #r#  erltten  » «  #1/1  •  |  »©r  the  time 

b#l*t, 

the  *htrp  Jpp,  ee *•*  f l*|  fh*  a*r*l«et  ft  the  he**l 
OP#* e tiny  system,  l|  the  w*|»  cnellenyley,  (Ifll  n*j§r 
|e**lcea  #re  ®**#e»<feef  to  In  t*e  (oiler «i©n, 

line  o*  thee  ere  elreefy  f*|el#*(  ly  #*l*lei,  bet  nr  t 
ly  Ot*ef  POtpnflel  he-,*!. 
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(I)  A  flit  tyifa*  It  guarantaad  tharaln  f  1 1  •  na*at 
have  tvo  part*)  ona  If  tha  nana  of  a  dlraetory, 
and  tha  othar  If  a  naoa  to  fa  loohad  up  In  that 
dl ractory, 

(21  Thl«  Ilia  tytta*  If  aatandad  to  covar  all  ttla 
davlcat,  aotantlally  Including  off-tlna  Nik  taaa 
floral,  tor  ava*9ta. 

(J)  irlfh  aach  Ilia  na*a  thara  It  racordad  a 
Catenation  Schana  Coda.  thlch  Indlcataf  tha 
aanaral  tyaa  and  format  ft  tha  tlla. 

14)  tilth  aach  flla  ns*  >»  thara  ara  racerdad 
lecatart  far  t«o  taparata  tactlant  at  tha  flla* 
and  (optionally)  an  antry  pol*t  to  a  propraa 
ataaclatai  with  tna  fllai  wtvtlly  ona  faction 
centainf  tha  data  ^a*tai*«i  and  tha  athar 
contain*  a  da  relation  at  tha  data  and  tnalr 
la, a,,  a  ***aehlna-raa«#fU  c«(»taok*l. 

It)  *  *tf«cfad,‘  a«»lronnant  It  provldnd  ae  that 
can  call  athar  irrfrmt  hraafh  tha 
Sniff  rata)  tha  cal  lad  prasrao  nay  call  vthtrt,  and 
ta  an,  fa  alnatt  ana  dar*th, 

tit  a  **-*•» » r #*% I  a* l©n  !«)■•  If  rratliaf  ta  that 
pratrant  can  canfral  tha  aa#n I  nff  n#**t  at  1 1  ha*a 

fa  rraaraat  f try  call. 

Ill  a  ttclilfy  tar  ef*itia  I/O  aiiaat  a 

ta  (Itirf  acft«i  nmnai  any  Input 
MtHlf)  t#f*  »  t*»ar«ln«fa  prp*ryw  «a  Iffal*, 

1*1  *  arnyy  ««  rai*t*«  pr**r*«%  ««»  ta  dHMT  art#  ta 
t*  a  *taallp*j  atanatar  a  a*  tha  tyally  It 

Ctllad,  tha  Stiff r af a  ptff  #11  HH«  #nf  ##r»l*) 
tpacat  tat*  #*  ft*  i#tt 

at  tha  faally  laM  m«, 

fat  lapdanant ♦* la*  a*  »»•  pratanv  5*%tfr#ta  a#t  a 
.pala#  tattaara  *<tp#«  i f  it#  tf*l«t  ff#»f.  If  If 
a#  f»#n  In  #1/1,  (ftyrhat  4W*  #(«%»  !»*»)#*(  Ilaat  •* 
tatrea  («(d,  tM  Incltf'at  »«*•  U  a#tar**|  calif  and  #n 
tffltlanal  Infarnal  #a«ft*a«,  ^ca  lapla*n#nta« t#n 

I*)**,  Ch#*sat  *t  tha  #*|ta*i  ##»f  §rat  I  *tih)|y  *aa  #*t 

tha  lap iaaa«f a* la*  ){%»f«|«  *a%  Mf  •♦ttfly, 


(;W*tMliai...  It#  ca***h«lt*t  f*a*  pr#f##at  any 
data  ft*  a«r«c*ad  H  l|ll»t  ara  fhf  !•(««!  **«*»  ptata 
at  it*  ifHatifi,  t»»y  can  t<#  dl*l##d  tn*a  inr##  *.»§* 
fratrti  ai#t*  data,  ittri  »n> 

nay  pra»fr§at  «##  cal  tat  #*f  ft#  fl*«M  »#•?  *l»  *#♦#* 
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and  conventions  about  transfer  to  other  boats. 

The  conventions  about  data  cantar  around  tha 
Description  Schamas.  Each  Ilia  ot  data  carries  with  It 
a  (Description  $cba«a  Coda  (which  It  racordad  with  tha 
name  cf  tha  Mia  In  tha  dlractory).  Retort  using  a 
file,  a  program  oust  cheeb  tha  Coda  to  saa  whether  tha 
1 1  i a  Is  ot  a  torn  It  can  handle,  and  «h«n  creating  a 
1 1  l a «  a  program  m  at  supply  tha  corract  Coda  tor  It. 

It  a  contributor  to  tha  collection  tlndt  that  non# 
ot  tha  tornafs  specified  by  e*lstlng  schenes  meet*  tha 
needs  ot  tha  programs  ha  Intends  to  write,  ha  nay 
datlna  a  new  scheme,  and  tha  adelnlsf refors  ot  tha 
collection  will  assign  an  arbitrary  Coda  to  It.  that 
Cod#,  and  the  specification  et  ype  new  scheee,  than 
becone  a  permanent  part  o*  tha  docunantaf Ion  ot  tha 
System,  In  theory  there  If  nothing  to  Seen  ts#  number 
ot  fchanef  term  nr*>|  I  *er#t  I  ng  Indefinitely!  however. 
It  will  usual Iv  be  easier  for  a  programmer  to  u»*  some 
Scheme  that  already  waists.  V#  hope  that  that  fact 
will  ten#  to  aewp  c» ntrlbutnrs  iron  defining  nee 
schemes,  ©r  that  socle  and  edml nl % f reflve  pressure 
will  at  le*st  *trt«act  then  to  provide  programs  thet 
translate  #a»*  to  and  irgt  schemes  that  rra  already 

popular. 

the  cgnwantlr  s  *POu*  programs  are  Centered  ero*n# 
the  I  #e  a  that  a  program  >%(tf||  he  able  to  act  es  the 

user’s  agent  In  running  otter  pregrant  tor  Mo.  In 

general,  «hep  Insist  mat  programs  be  called  In  a 

«nl  torn  oynn»r,  #nr  that  each  pro«,**m  be  a  neat,  tidy 
modal#  ehnse  If *a*lor  an  agent  can  control  e«%l |y.  In 
particular,  they  'acted*  sue*  provisions  #%  ftesei 

then  tne  Sebftrate  ?w*%  a  program  la*  the 
IrsMgttlOh  o»  •  *yh  Program  call  from  an  #«©* ner 
progrynl.  If  assumes  the  c*> led  program  will  ha*#, 
iaraggwlcelly,  the  tore  ot  a  *1/1  Subroutine  with 
#  parti c*l *f  »#'*  I*  calling  seguence. 

Qaw  91  fa*  *r gomes t%  1*  the  calling  segment*  If  a 
String  g*  Characters  that  arw  *d 1 r ef *  I ens*  Vg  the 
Callef  nf#g#*n,  the  called  prp.gr an  |%  responsible 
•or  <#*c*#lag  these  dlr*rtin*%  IfseM,  #nf  It  oust 
ret%rn  an  errgr •e«ssag#  fO  the  user  In  some 
#r<perly  **f  l»  are  unintelligible  tf  It, 

StprmeMy,  a  prggrem  appeals  "O  »h# 
name»»f  a*f!af  l#n  fable  fg  discover  mg  true 
omanlngs  g»  the  **f*s  0#  any  files  fhp  llflfjlptl 
tell  I*  fO  lns?»C»  «'  Create.  fM$>  p«rol«%  pn 
agent  «g  Cfnfrgl  the  meanings  that  mpegs  el  1 1  ae*e 
*0  f r«*gr**s  IV  runs  tgr  the  user. 
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If  a  orogrom  send)  messages  to  of  asks  for  Inputs 
from  the  user's  terminal.  It  must  do  so  throuqh 
Channels  nn  aqent  proqram  cen  Intercept,  so  that 
the  nnenf  will  bo  able  to  conduct  the  conversation 
In  the  user's  stead. 

Only  very  special  programs  are  permitted  to  ask 
the  Substrate  to  Interfere  with  the  program 
■stack"  that  was  mentioned  above  as  one  of  the 
services  the  Substrate  guarantees. 

Final ly,  the  conventions  that  are  designed  to  make 
the  collection  easier  to  transfer  to  other  hosts 
prohibit  the  use  o*  those  features  of  PL/I  end  FORTRAN 
mat  are  not  likely  to  be  evellaole  on  other  hosts, 
fhoy  also  provide  that  only  special  programs  may  use 
certain  •Irregular*'  features  of  the  Substrate  that 
probably  cannot  be  duplicated  In  precisely  the  same 
form  on  ai l  hosts. 

The  conventions  have  been  shaken  down  In  the 
process  of  codlno  the  utility  subroutines  and  basic 
programs  that  will  be  mentioned  below.  They  see«  to  be 
In  reasonably  good  shape,  although  If  Is  now  evident 
that  they  need  to  he  codified  and  reworded  to  make  them 
eeclstr  tor  programme rs  to  understand. 

1  1 1  fy  Ssihrmut  lees  «ng  b«|lc  programs.*-  As  the 
actual  cans  to  the  Substrate  are  »v  design  oulte 
primitive,  they  are  supplemented  by  utility  subroutines 
whlc?  eilow  ppoaramml  og  languages  like  FORTRAN  to  more 
easily  call  upon  the  Substrate  tor  service.  Rhlle 
these  are  of  little  Immediate  concern  to  the 

eenpregramml ng  behavioral  scientist  who  would  simply 
use  the  system,  they  are  of  value  to  those  who 
Incidentally  have  programming  skills  and  a  desire  to 
write  Consistent  System  programs.  Standardised  uti¬ 
lity  subroutines  make  programs  aaslar  to  write  and  tnsurt 
a  moderate  level  of  consistency  la  outward  bthavlor  and 
duality  Internally,  They  are  elto  Important  from  the 
point  o«  view  of  economy i  It  Is  not  nece-tery  that  Indi¬ 
vidual  programs  contain  the  code  needed  to  perform  the 
functions  these  Subroutines  perform. 

To  gate,  a  variety  gt  utility  subroutine*  have  been 
provided  which,  among  other  things.  Parse  the  string  of 
directions  with  which  a  program  we*  Invoked,  resol  we  a 
Symbolic  *lle  name,  open  **vwrel  tiles  at  once,  find  a 
given  element  In  an  array,  ana  allow  a  graceful  return 
jn  the  event  #•  some  catastrophe, 

*  snail  grove  *>t  service  nrogr,an%  has  also  been 
provid'd  t#  let  "ha  user  Inspect  definitions  In  his 
directory.  Inspect  tt»e  g»ana  of  programs,  end  a 

session,  ang  j«r tgra  Other  essential  actions  pt  that 

S«bf*. 
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More  Important,  from  the  view  of  the 
nonprogremml nq  behavioral  scientist,  It  tha  way  In 
which  ha  can  exercise  a  program  from  hit  ronsola. 
Thara  It  a  command  procassor  or  "bnylc  cal  lor"  notabla 
for  Ita  almost  nonaxlitant  syntax  and  relatively 
trivial  organization  -  qualities  which  Insuro, 

respect  I va I y :  freedom  to  design  a  command  with  nearly 
any  syntax  deemed  appropriate  by  Its  author;  and  the 
ability  to  replace  the  "basic  caller"  with  a  more 
specialized  command  translator.  Perhaps  more 

Interesting  Is  a  "macro-command"  facility  that  allows  a 
sequence  of  orespeclfled  commands  to  be  executed  as  a 
logical  unit;  the  execution  It  Interpretive,  and 
meta-statements  are  provided  to  control  the  flow  of 
execution  and  parameterize  the  Inputs  to  Individual 
commands . 

The  service  programs,  the  basic  caller,  end  the 
macro  Interpreter  constitute  a  nucleus  around  which  the 
library  can  grow.  They  are  the  programs  a  programmer 
needs  to  test  others  that  he  will  contribute  to 
col  lection. 

Documentation.—  Because  the  Consistent  System  Is 
being  devised  with  long  term  use  as  a  goal, 
documentation  Is  of  paramount  Importance.  In  time,  an 
enormous  number  of  proqrams  wl 1 5  have  to  be  used 
fluently  by  a  large  population  of  behavioral 
scientists,  and  the  programs  will  have  to  he  maintained 
by  generations  of  programmers.  Only  so  I  f-sustal nl ng 
documentation  will  keep  these  efforts  less  than 
Herculean.  This  past  six  months  has  seen  a  review  of 
the  documentation  needs  of  the  project  and  Its  user 
community,  and  planning  for  their  fulfillment.  The 
experience  accumulated  In  documenting  the  Toss  system 
was  helpful  In  this  regard;  details  about  It  are  to  be 
found  In  Section  6.2  of  the  second  Annual  Report. 

Our  plans  call  for  two  general  categories  of 
documents;  tutorials  for  a  general  understand! ng,  end 
reference  manuals  for  specific  Information  about  a 
particular  routine  or  file  of  data.  The  tutorial 
manuals  will  be  relatively  static  once  produced,  but 
the  reference  documents  will  continually  accumulate  new 
sheets  for  Insertion  as  new  contributions  come  Into  the 
system. 

The  tutorial  category  can  be  further  subdivided 
Into  e  general  Introduction  for  users  of  the  Consistent 
System,  Introductions  for  particular  classes  of  users 
or  about  particular  subsystems,  end  a  programmar’s 
Introduction  to  the  Consistent  System. 
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The  reference  documentation  will  tend  to  be  more 
piecemeal ,  more  concerned  with  Individual  programs, 
subroutines,  and  sets  of  data.  It  will  also  be  more 
uniform:  there  will  be  standard  formats,  and  authors 
will  be  urged  to  stick  to  them  If  at  all  possible. 
Reference  material  will  separate  Itself  Into  two  tyoes 
—  that  distributed  widely,  and  that  kept  In  archives 
at  the  Project  headquarters  to  be  distributed  on 
demand,  a  copy  at  a  time.  The  latter,  less  visible 
documentation  Is  typically  used  for  maintenance  and 
modification.  We  assume  that  routines,  data,  and  the 
documentation  on  them  will  be  used  much  more  often  than 
they  will  be  changed,  and  so  It  Is  reference 
documentation  for  use  that  Is  the  more  widely 
dl strl buted. 

Every  name  that  appears  In  a  public  directory 
whether  the  name  of  a  program,  data-flle,  or 
combination  thereof  --  Is  represented  by  one  or  more 
sheets  that  comprise  the  user’s  reference  manual.  The 
purpose  of  these  sheets  Is  to  describe  the  external 
apoearance  of  the  entity  In  standardized  language 
understandable  to  the  class  of  users  for  which  It  Is 
Intended,  For  programs,  the  descriptions  must  be  exact 
enough  to  enable  a  user  to  use  the  programs 

aporopr I  ate  I y  In  a  "macro-command”.  Learning  how  to 
write  the  descriptions  that  go  Into  this  book  Is  one  of 
the  most  Important  challenges  the  Project  faces.  The 
description  must  both  be  complete  and  be  comprehensible 
to  the  users  forwhom  It  Is  Intended;  when  the  users  are 
not  programmers  In  the  traditional  sense,  satisfying 
both  demands  Is  difficult. 

For  those  users  who  are  programmers  and  want  to 
write  programs  or  subroutines  for  themselves  or  for  the 
public  collection,  appropriate  reference  material, 
different  from  that  described  above,  must  be  provided. 
It  must  Include  Information  on  public  subroutines,  on 
Description  Schemes  that  have  been  officially 
registered,  on  the  behavior  of  the  Substrate,  and  on 
conventions  a  program  must  follow  If  It  Is  to  be 
admitted  Into  the  public  collection. 

For  the  same  audience  there  Is  on  on-line  register 
of  external  symbols.  This  register  lists  all  external 
symbols  used  by  each  program  In  the  public  library,  as 
well  as  all  symbols  that  have  been  reserved  for  future 
use.  As  presently  conceived,  this  register  Is 
I nterrogatlve  and  will  provide,  on  demand,  a  symbol  or 
symbols  guaranteed  not  to  conflict  with  those  already 
In  use  or  reserved  for  future  use. 

This  register  Is,  In  part,  an  asset  to  good 
documentation  practice;  It  Is  also  the  first  In  a 
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class  of  control  procedures  that  will  ultimately  rank 
with  the  Substrate,  conventions,  and  documentation  as  a 
fourth  part  of  the  framework  of  the  Consistent  System. 
Additional  procedures  must  be  worked  out  for  program 
acceptance  and  Installation,  quality  assurance,  and 
ma I ntenance . 

Growth  o f  the  I  I brary.--  Of  course,  the  real  power 
of  the  Consistent  System  for  the  nonprogramming  user  Is 
In  the  spectrum  of  programs  It  makes  available  to  him. 
By  summer  the  collection  should  be  rich  enough  so  that 
a  hardy  researcher  will  be  able  to  make  practical  use 
of  It  In  some  kinds  of  work.  The  programs  that  are  to 
be  included  have  been  mentioned  In  previous  sections 
and  the  following  Is  only  a  brief  overview. 

Statistical  procedures  that  are  expected  to  be 
available  include  all  the  standard  parametric  and 
nonparametr I c  hypothesis  tests  like  t-tests  and 
F-tests.  Chi-square,  tau,  product-moment  and  rank-order 
correlations  will  also  be  available.  A  newly 
developed.  Interactive  factor-analysis  program  Is 
expected,  as  are  the  multiple  linear  regression  and 
ANOVA  (analysis  of  variance,  with  covariance  added) 
programs  which  are  now  parts  of  DATA-TEXT. 

Additional  data  analysis  tools  expected  Include 
the  TSP  (Time  Series  Processor)  package  developed  for 
the  econometrician,  and  two  transaction  flow  analysis 
programs,  "RANULL"  and  "D1CH0T".  Covering  those 
Instances  where  specifically  designed  tools  are 
lacking,  a  set  of  matrix  oDerators  will  be  nrovlded 
enabling  a  behavioral  scientist  to  oursue  more 
individualistic  analytic  methods.  We  expect  the 
"macro-command”  facility,  described  above,  to  make 
these  pursuits  easier.  The  principal  tool  for 
data-management  and  data-edltlng  will  be  the  prototype 
version  of  the  Janus  subsystem. 

During  the  coming  half  year  the  library  Is 
expected  to  enter  a  p’jriod  of  rapid  growth  as  programs 
from  Toss  are  transferred  to  it,  and  as  other  programs 
that  the  participants  In  the  Project  have  been 
preparing  begin  to  accumulate  within  It. 
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